Pascal and Francis Bibliographic Databases

Help

Search results

Your search

kw.\*:("Reinforcement learning")

Filter

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Document Type [dt]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Publication Year[py]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Discipline (document) [di]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Author Country

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Results 1 to 25 of 1087

  • Page / 44
Export

Selection :

  • and

Reinforcement learning for discounted values often loses the goal in the application to animal learningYAMAGUCHI, Yoshiya; SAKAI, Yutaka.Neural networks. 2012, Vol 35, pp 88-91, issn 0893-6080, 4 p.Article

Adaptive game AI with dynamic scripting : Machine learning and gamesSPRONCK, Pieter; PONSEN, Marc; SPRINKHUIZEN-KUYPER, Ida et al.Machine learning. 2006, Vol 63, Num 3, pp 217-248, issn 0885-6125, 32 p.Article

DistanceRank : An intelligent ranking algorithm for web pagesALI MOHAMMAD ZAREH BIDOKI; YAZDANI, Nasser.Information processing & management. 2008, Vol 44, Num 2, pp 877-892, issn 0306-4573, 16 p.Article

Graph kernels and Gaussian processes for relational reinforcement learningDRIESSENS, Kurt; RAMON, Jan; GÄRTNER, Thomas et al.Machine learning. 2006, Vol 64, Num 1-3, pp 91-119, issn 0885-6125, 29 p.Conference Paper

Evidence for learning to learn behavior in normal form gamesSALMON, Timothy C.Theory and decision. 2004, Vol 56, Num 4, pp 367-404, issn 0040-5833, 38 p.Article

A Modified Memory-Based Reinforcement Learning Method for Solving POMDP ProblemsLEI ZHENG; CHO, Siu-Yeung.Neural processing letters. 2011, Vol 33, Num 2, pp 187-200, issn 1370-4621, 14 p.Article

The asymptotic equipartition property in reinforcement learning and its relation to return maximizationIWATA, Kazunori; IKEDA, Kazushi; SAKAI, Hideaki et al.Neural networks. 2006, Vol 19, Num 1, pp 62-75, issn 0893-6080, 14 p.Article

The first learning track of the international planning competitionFERN, Alan; KHARDON, Roni; TADEPALLI, Prasad et al.Machine learning. 2011, Vol 84, Num 1-2, pp 81-107, issn 0885-6125, 27 p.Article

Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning modelJOHNSON, Adam; REDISH, A. David.Neural networks. 2005, Vol 18, Num 9, pp 1163-1171, issn 0893-6080, 9 p.Article

Feedforward neural networks in reinforcement learning applied to high-dimensional motor controlCOULOM, Rémi.Lecture notes in computer science. 2002, pp 403-413, issn 0302-9743, isbn 3-540-00170-0, 11 p.Conference Paper

An Actor―Critic based controller for glucose regulation in type 1 diabetesDASKALAKI, Elena; DIEM, Peter; MOUGIAKAKOU, Stavroula G et al.Computer methods and programs in biomedicine (Print). 2013, Vol 109, Num 2, pp 116-125, issn 0169-2607, 10 p.Article

Economic impact assessment and operational decision making in emission and transmission constrained electricity markets : Smart GridsNANDURI, Vishnu; KAZEMZADEH, Narges.Applied energy. 2012, Vol 96, pp 212-221, issn 0306-2619, 10 p.Article

Embedding a priori knowledge in reinforcement learningRIBEIRO, C. H. C.Journal of intelligent & robotic systems. 1998, Vol 21, Num 1, pp 51-71, issn 0921-0296Article

Towards a life-long learning soccer agentKLEINER, Alexander; DIETL, Markus; NEBEL, Bernhard et al.Lecture notes in computer science. 2003, pp 126-134, issn 0302-9743, isbn 3-540-40666-2, 9 p.Conference Paper

Combining exploitation-based and exploration-based approach in reinforcement learningIWATA, Kazunori; ITO, Nobuhiro; YAMAUCHI, Koichiro et al.Lecture notes in computer science. 2000, pp 326-331, issn 0302-9743, isbn 3-540-41450-9Conference Paper

Reinforcement learning : Past, present and futureSUTTON, R. S.Lecture notes in computer science. 1999, pp 195-197, issn 0302-9743, isbn 3-540-65907-2Conference Paper

Finding hidden hierarchy in reinforcement learningPOULTON, Geoff; YING GUO; WEN LU et al.Lecture notes in computer science. 2005, issn 0302-9743, isbn 3-540-28894-5, vol3, 554-561Conference Paper

Relational reinforcement learningDRIESSENS, Kurt.Lecture notes in computer science. 2001, pp 271-280, issn 0302-9743, isbn 3-540-42312-5Conference Paper

Experimental evidence on case-based decision theoryOSSADNIK, Wolfgang; WILMSMANN, Dirk; NIEMANN, Benedikt et al.Theory and decision. 2013, Vol 75, Num 2, pp 211-232, issn 0040-5833, 22 p.Article

Models of trace decay, eligibility for reinforcement, and delay of reinforcement gradients, from exponential to hyperboloidKILLEEN, Peter R.Behavioural processes. 2011, Vol 87, Num 1, pp 57-63, issn 0376-6357, 7 p.Article

Dissociated roles of the anterior cingulate cortex in reward and conflict processing as revealed by the feedback error-related negativity and N200BAKER, Travis E; HOLROYD, Clay B.Biological psychology. 2011, Vol 87, Num 1, pp 25-34, issn 0301-0511, 10 p.Article

On the possibility of learning in reactive environments with arbitrary dependenceRYABKO, Daniil; HUTTER, Marcus.Theoretical computer science. 2008, Vol 405, Num 3, pp 274-284, issn 0304-3975, 11 p.Conference Paper

A two-layered multi-agent reinforcement learning model and algorithm : Information technologyWANG, Ben-Nian; YANG GAO; CHEN, Zhao-Qian et al.Journal of network and computer applications. 2007, Vol 30, Num 4, pp 1366-1376, issn 1084-8045, 11 p.Conference Paper

Physiological and behavioral signatures of reflective exploratory choiceOTTO, A. Ross; KNOX, W. Bradley; MARKMAN, Arthur B et al.Cognitive, affective & behavioral neuroscience (Print). 2014, Vol 14, Num 4, pp 1167-1183, issn 1530-7026, 17 p.Article

Socially embedded cognitionHUEBNER, Bryce.Cognitive systems research (Print). 2013, Num 25-26, pp 13-18, issn 2214-4366, 6 p.Article

  • Page / 44